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We have searched for D° — D mixing in D* + -> tv + D° decays with D° ->■ K { * ] ev in a sample of 
e + e~ — * cc events produced near 10.58 GeV. The charge of the slow pion from charged D* decay 
tags the charm flavor at production, and it is required to be consistent with the flavor of a fully 
reconstructed second charm decay in the same event. We observe 3 mixed candidates compared 
to 2.85 background events expected from simulation. We ascribe a 50% systematic uncertainty to 



this expected background rate. We find a central value for the mixing rate of 0.4 x 10~ 4 . Using a 
frequentist method, we set corresponding 68% and 90% confidence intervals at (—5.6, 7.4) x 10~ 4 
and (—13, 12) x 10 -4 , respectively. 

PACS numbers: 13.20.Fc, 12.15.Ff 



I. INTRODUCTION 

The D° and D° mesons are flavor cigenstates which are 
invariant in strong interactions, but are subject to elec- 
troweak interactions that permit an initial flavor eigen- 
state to evolve into a timc-dependent mixture of D° 
and D°. In the Standard Model (SM), such oscillations 
proceed through both short-distance and long-distance, 
non-perturbative amplitudes. The expected mixing rate 
mediated by down-type quark box diagrams [l[ and di- 
penguin 0] diagrams is 0(1O~ 8 — 10~ 10 ), well below the 
current experimental sensitivity of 0(1O -4 — 10 -3 ) [||. 
The predicted range for non-perturbative, long-distance 
contributions Q is approximately bounded by the box 
diagram rate and the current experimental sensitivity. 
New physics predictions span the same large range [J]. 
While the presence of a mixing signal alone would not be 
a clear indication of new physics, the current experimen- 
tal bounds already constrain many new physics models. 

Because D° — D° mixing has been considered a po- 
tential signature for new physics, and because CP vi- 
olation in such mixing would be a signature for new 
physics, there have been many searches for D° — D° 
mixing. Typically, these searches use samples of neu- 
tral D mesons produced as decay products of charged 
D* mesons where the charge of the slow pion (tt s ) pro- 
duced in association with the neutral D meson tags the 
production flavor of the neutral D meson. In semilep- 
tonic decays (D° — > K^*'ev), the flavor of the neutral 
D meson when it decays is uniquely identified by the 
charge of the lepton. The signs of the slow pion and lep- 
ton charges are the same for unmixed decays and they 
differ for mixed decays. Historically, these two classes 
of decays are denoted as right-sign (RS) and wrong-sign 
(WS), respectively. 

The B-factory experiments have searched for D° — D° 
mixing using semileptonic (SL) decays, where the initial 
flavor of the neutral D meson is tagged by the charge of 
the slow pion from a D*^ decay. The limits on r m i x (de- 
fined below) from these experiments are listed in Ta- 
ble HI along with those from recently published searches 
for D° — D° mixing using hadronic decay modes @, B Q • 
In the earlier BaBAR SL analysis [j| , the dominant source 
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of background in the WS signal channel originated from 
RS SL D° decays falsely associated with WS slow pion 
candidates. In this analysis we tag the initial flavor of 
the neutral D meson twice: once using the slow pion 
from the charged D* decay from which the neutral D 
decays semileptonically, and once using the flavor of a 
high-momentum D fully reconstructed in the center-of- 
mass (CM) hemisphere opposite the semileptonic can- 
didate. Tagging the flavor at production twice, rather 
than once, highly suppresses the background from false 
WS slow pions but also reduces the signal by more than 
an order of magnitude. The BABAR collaboration has pre- 
viously used this tagging technique in a measurement of 
the pseudoscalar decay constant fo s [HI- We have im- 
plemented additional candidate selection criteria to min- 
imize remaining sources of background; the sensitivity 
of this double-tag analysis is estimated to be about the 
same as that of a corresponding single-tag semileptonic 
analysis for the same dataset. 

Charm mixing is generally characterized by two dimcn- 
sionless parameters, x = Am/T and y = AL/2L, where 
Am = ni2 — mi (AT = T2 — Ti) is the mass (width) dif- 
ference between the two neutral D mass eigenstates and 
r is the average width. If either x or y is non-zero, then 
D°-D° mixing will occur. The decay time distribution 
of a neutral D meson which changes flavor and decays 
semileptonically, and thus involves no doubly interfering 
Cabibbo-suppressed (DCS) amplitudes, is [111 ] : 

x 2 _|_ y 2 / ( \ 2 

Rmix(t) = -Runmix(i) ~. , (1) 

4 V T D a J 

where t is the proper time of the D° decay, Tr>o is the 
characteristic D° lifetime (= 1/L), i? U nmix(0 °c e - */ 7 "^ , 
and the approximation is valid in the limit of small mix- 
ing rates. Sensitivity to x and y individually is lost with 
semileptonic final states. The time-integrated mixing 
rate r m ; x relative to the unmixed rate is 



II. THE BABAR DETECTOR AND DATASET 

The data used in this analysis were collected with the 
BABAR detector [12] at the PEP-II storage ring. The in- 
tegrated luminosity used here is approximately 344 fb -1 , 
including running both at and just below the T(4S") res- 
onance. Charged-particle momenta are measured in a 
tracking system consisting of a five-layer double-sided 
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TABLE I: Limits for r m j x from earlier measurements in e + e experiments. 

Experiment Decay Mode Integrated Luminosity Upper Limit 

BABAR [5] D° K^'e+v 87 fb" 1 < 42 x 10" 4 (90% CL) 

Belle [6] D° K M ~e+v 254 fb" 1 < 10 x 10~ 4 (90% CL) 

CLEO [7] D°^A"g7r-7r+ Oft)" 1 < 63 x 10" 4 (95% CL) 

Belle [8] D° -> Jf+Tr" 400 fb" 1 < 4.0 x 10" 4 (95% CL) 

BABAR [9] D° -> AT+Tr'TT 230 fb" 1 < 5.4 x 10~ 4 (95% CL) 



silicon vertex tracker (SVT) and a 40-layer central drift 
chamber (DCH), both situated in a 1.5-T axial magnetic 
field. An internally reflecting ring-imaging Cherenkov 
detector (DIRC) with fused silica bar radiators provides 
charged-particle identification. A CsI(Tl) electromag- 
netic calorimeter (EMC) is used to detect and identify 
photons and electrons and measure their energies. Muons 
are identified in the instrumented flux return system 
(IFR). 

Electron candidates are identified by the ratio of the 
energy deposited in the EMC to the measured track 
momentum, the shower shape, the specific ionization 
measured in the DCH, and the Cherenkov angle mea- 
sured by the DIRC. Electron identification efficiency is 
greater than 90% at all momenta of interest here. Pion- 
as-electron misidentification rates increase from about 
0.05% to 0.15% from 500McV/c to 3 GcV/c. Kaon-as- 
electron misidentification rates peak at about 2.5% near 
500MeV/c and decrease to 0.2% above 800MeV/c. 

Kaon candidates are selected using the specific ioniza- 
tion (dE/dx) measured in the DCH and SVT, and the 
Cherenkov angle measured in the DIRC. Kaon identifi- 
cation efficiency is a function of laboratory momentum; 
it is typically 80% or higher over the range 500MeV/c 
to 3.5 GeV/c, with a maximum of about 90% at 2 GeV/c. 
The pion-as-kaon misidentification rate is typically 1% or 
less for momenta below 2 GeV/ c, rising to about 5% at 
3.5 GeV/c. 



III. ANALYSIS 

The initial selection of semileptonic decay candidates 
follows the single-tag analysis described in Ref. [f|. For 
each D* + — > D°ir + ;D° — > K^ev candidate (charge con- 
jugation is implied in all signal and tagging modes), we 
calculate the D* + —D° mass difference AM = m(Kew) — 
m(Ke) and the proper lifetime, as well as the output of 
an event selection neural network (NN). We then require 
that a high-momentum D decaying hadronically be fully 
reconstructed in the opposite hemisphere of the event. 
This ensures that the underlying production mechanism 
is e + e~ — > cc and provides a second production flavor 
tag. We implement additional candidate selection crite- 
ria based on studies of alternate background samples in 
data and a Monte Carlo (MC) simulated event sample 
(the "tuning" sample) to reduce various sources of back- 



ground. The quark fragmentation in e + e~ — ► cc MC 
events is simulated using JETSE T 1131 . the detector re- 
sponse is simulated via GEANT4 [14[ , and the resulting 
events are reconstructed in the same way as are real data. 

To minimize bias, we use a MC sample (the "unbi- 
ased" sample) disjoint from the tuning sample, to ob- 
tain all MC based estimates of efficiencies and back- 
grounds. We study this sample, with effective luminos- 
ity roughly equivalent to 603 fb" 1 (« 1.75 x data) only 
after all selection criteria and the full analysis method 
have been established. After the expected background 
rates are determined from the unbiased MC, we exam- 
ine the signal region in the data and determine the net 
number of observed RS and WS signal events (riRs an d 
^ws)- The measured mixing rate is then determined as 
TVnix = ^ws/^RSj corrected for the relative efficiency of 
the WS and RS signal selection criteria. 



A. Reconstruction and Selection of Semileptonic 
Signal Candidates 

Semileptonic signal candidates are selected by recon- 
structing the decay chain D* + — ► tt + D°,D — > K^ev. 
There are no essential differences for this analysis be- 
tween the K and K* — > K T ir° modes, either theoret- 
ically or empirically, and thus no attempt is made to 
reconstruct the K* — its charged K daughter is treated 
as if it were a direct daughter of the D°. Approximately 
11% of signal candidates accepted in the initial selection 
of semileptonic canidates [|[ are in the K* mode. 

Identified K and e candidates of opposite charges are 
combined to create neutral candidate D decay vertices. 
Only candidates with vertex fit probability > 0.01 and 
invariant mass < 1.82 GeV/c 2 are retained. This re- 
quirement is imposed to exclude all hadronic two-body 
D° decays. The average PEP-II interaction point (IP), 
measured on a run-to-run basis using Bhabha and 
events, is taken as the production point of D candi- 
dates. The D° decay time is measured using the trans- 
verse displacement of the D° vertex from the IP and the 
D° transverse momentum due to the relative narrowness 
of the position distribution of the IP in the transverse (r- 
(j>) plane [T^]. Neural networks are used to estimate the 
boost of signal candidates in the r-<p plane, as discussed 
below. 

The pions from D* + decays are relatively soft tracks 
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with p* < 450 MeV/c, where the asterisk denotes a 
parameter measured in the e + e~ CM frame. Charged 
tracks identified as either a charged K or e candidate are 
not considered as 7r candidates. To reject poorly recon- 
structed tracks, n candidates are required to have six or 
more SVT hits, with at least 2 hits in each of the r-cj) and 
z views, and at least one hit on the inner three layers in 
each of the r-<f> and z views. Pion candidates are refit 
constraining the tracks to originate from the IP, and are 
accepted only if the refit probability is greater than 0.01. 
Pion candidates meeting the above criteria are combined 
with Ke vertex candidates to form D* + candidates. 

A reasonable estimate of a signal candidate's proper 
decay time cannot be obtained using the partial recon- 
struction described above and, therefore, the three or- 
thogonal components of the D° CM momentum vector 
are estimated with three separate JetNet 3.4 [15j neu- 
ral networks. Each NN has two hidden layers, and is 
trained and validated with a large sample (O(10 5 )) of 
simulated signal events generated separately from the 
other MC samples used in the analysis. The following 
vector inputs to the NN's are used: p*(Ke) (the mo- 
mentum of the Ke pair constrained by the vertex fit), 
p*(ir), and the event thrust vector T* (calculated using 
all charged and neutral candidates except the K and e 
candidates). For simulated signal events, the distribution 
of the difference between the true p*(D°) direction and 
the NN output direction is unbiased and Gaussian with 
a w 130 mrad. The distribution of momentum magni- 
tude differences shows an uncertainty of u p /p ~ 10% for 
a typical signal event. 

The transverse momentum of a D° candidate and the 
projections of the IP and Ke vertex loci on the r-<f> plane 
are used to calculate a candidate's proper decay time. 
The error on the decay time, calculated using only the 
errors on the IP and Ke vertex, is typically 0.8tdo, where 
r D a is the nominal mean D° lifetime. The contribu- 
tion of the p*(D°) estimator to the total decay time un- 
certainty is approximately 10% and is ignored. Poorly 
reconstructed events, with calculated decay time errors 
greater than 2t d o, are discarded, and only events with 
decay times between — \2te>q and 15t£>o are retained. 
These criteria remove about 7% of the signal decays. 

In addition to the above criteria, events are selected us- 
ing a neural network trained to distinguish prompt charm 
signal from slow pion single-tag WS background events. 
The event selector NN uses a five-element input vector: 
p* Ke ,p* n , \T*\, 9*{p* Ke ,T*), nnd6*(p* K ,p* e ) where 0(a,b) 
denotes the opening angle between the vectors a and b. 
It has a single hidden layer of nine nodes and is also 
constructed using JetNet 3.4. Figure QJa) shows the dis- 
tribution of NN output for signal candidates, RS back- 
grounds and WS backgrounds in double-tag unbiased MC 
events passing the semileptonic side event selection crite- 
ria given above. The NN output is required to be greater 
than 0.9 in order to yield the best statistical sensitivity 
to WS signal events, assuming a null mixing rate, for the 
double-tag dataset used here. 



B. The Hadronic Tagging Samples 

We use the flavor of fully reconstructed charm decays 
in the hemisphere opposite the semileptonic signal to 
additionally tag the production flavor of the semilep- 
tonic signal, and thus significantly reduce the rate of 
wrongly tagged candidates. We use five hadronic tagging 
samples. Three samples explicitly require D* + decays: 
D*+ -> D°tt+ where D° -> K~ir + , D° -> K-jt+'tt , 
and D° — > K~~ir + ir + ir~; while the other two samples 
are not related to D* + decays: D° — > K~it + and 
D + — > K~tt + tt + . Candidates from the D* + sample are 
explicitly excluded from the more inclusive D° —> K~tt + 
sample to ensure that the tagging samples are disjoint. 

The selection criteria for the tagging samples, such as 
the AM ranges for the D* modes or the use of production 
and decay vertex separation for the D + mode, vary from 
channel to channel to balance high purity against high 
statistics. Potential criteria are studied using candidate 
events from a RS sample chosen with loose requirements 
on the semileptonic side. To eliminate candidates from 
BB events, we require the CM momentum of the tag- 
side D be at least 2.5GcV/c. The individual D*+ , D° 
and D + tagging candidate invariant mass distributions 
for the final RS sample are shown in Figure [21 The puri- 
ties of these tagging samples, defined as the ratio of signal 
in the selected mass range to the total number of candi- 
dates in that range (the darkly shaded entries in each his- 
togram), vary from about 92% for D° —> K~ir + which do 
not come from D* + to 97.5% for D° -»■ K~tt + from D*+ . 
In the very few events where there are multiple hadronic 
tag candidates, we use the tag coming from the highest 
purity tagging sample. We reject events with multiple 
semileptonic signal candidates, requiring that one and 
only one candidate, whether RS or WS, be present af- 
ter all tagging and basic semileptonic side selections are 
imposed. This requirement rejects approximately 13% of 
signal candidates and a similar fraction of background 
candidates. 



C. Additional Semileptonic Side Selection Criteria 

Double-tagging the production flavor of the neutral D 
mesons effectively eliminates the WS background due to 
real semileptonic D decays paired with false slow pions 
from putative D* + decay. From studies of background 
events in the tuning MC sample, we find that kaon and 
electron candidates are almost always real kaons and elec- 
trons, respectively, with correctly assigned charges. We 
also find that many fake slow pion candidates are elec- 
trons produced as part of conversion pairs, or Dalitz de- 
cays of 7T° or, to a much lesser extent, rj mesons. These 
processes also contribute background tracks to the pool 
of electron candidates used to create Ke vertices combi- 
natorically. We consequently implement selection criteria 
to reject tracks which may have originated in such pro- 
cesses by requiring that neither an electron nor slow pion 
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FIG. 1: Event selector NN output, from double-tag luminosity-scaled unbiased MC, for events (a) passing the initial selection 
of semileptonic side events and (b) passing the additional semileptonic side selection criteria. The distributions shown are those 
for signal candidates (dashed line), RS backgrounds (solid line), and WS backgrounds (solid fill). The final event selection 
requires NN output > 0.9. 



TABLE II: Effects of additional semileptonic side selection criteria. Approximate cumulative acceptance rates in the double-tag 
unbiased MC for signal and WS background as additional selection criteria are applied, relative to acceptances following the 
initial semileptonic-side selection. The signal acceptances are the same for RS and WS signal samples except for the decay 
time cut where the entry is that for the WS sample. 



Criterion Sif 


mal Retained WS Backf 


ground Retained 


e* conversion and Dalitz pair veto 


100% 


82% 


n s dE/dx cut 


85% 


66% 


tt s pt and pl selection 


72% 


36% 


m(Ke) > 0.8GeV/c 2 


71% 


30% 


(M(Ke),AM) kinematic cut 


70% 


20% 


600 < t < 3900 fs 


55% 


10% 



candidate form a conversion pair when combined with an 
oppositely charged track treated as an electron (whether 
identified as such or not). We further require that elec- 
tron candidates not form a 7r° candidate when combined 
with a photon candidate and an oppositely charged track 
treated as an electron. (After applying all event selec- 
tions, we find no contribution in the tuning MC sample 
from rj Dalitz decays.) Rejecting photon conversions and 
Dalitz decays reduces the total RS and WS backgrounds 
by about 20% each, and has a negligible effect on signal 
efficiency. 

To reduce backgrounds from kaons misidentified as 
electrons, we require that the laboratory momentum of 
electron candidates be greater than 600 MeV/ c. This re- 
duces the signal efficiency by about 15% and the back- 
ground rate by about 35%. To further reduce the num- 



ber of electrons that are considered as slow pions, we veto 
tracks where dE / dx in the SVT is consistent with that of 
an electron. This reduces the signal efficiency by about 
15% and the background rate by about 25%. 

We study kinematic distributions that discriminate be- 
tween signal and background using data and MC events 
with two fully reconstructed hadronic decays of charm 
mesons. As a result, we require that the slow pion CM 
longitudinal momentum (along the axis defined by the 
direction opposite the tagging _D's CM momentum) lie 
in the range 150 — 400 MeV/c and its transverse momen- 
tum be less than 80 MeV/ c. This reduces the background 
by approximately 40% and the signal efficiency by ap- 
proximately 15%. 

We require that the electron-kaon invariant mass be 
greater than 800 MeV/ c 2 ; this reduces the signal efficiency 
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FIG. 2: Invariant mass distributions of hadronic tagging candidates in data events with RS semileptonic candidates passing 
the initial semileptonic-side selection. The dark shaded regions denote hadronic candidates used as tags. Sideband "1" events 
are used to characterize "false tag" rates and sideband "2" events are used for background normalization in optimizing the 
hadronic mass selection. Top row: (a) D° -> R-TT+, (b) D + K~ 7r + 7r + . Bottom row: D* + — > D°ir + events in final states 

(c) D° -> K--K+, (d) D° -> K-TT+TY + TT-, (e) D° -> A^TT+TT . 



by a few percent while it reduces the background rate by 
approximately 20%. When we count the final number 
of signal candidates, we also require that (M(Ke), AM) 
lies inside the kinematic boundary expected for D* + — > 
D°tt + ; D° — > Kev decays where the neutrino momen- 
tum is ignored. This has essentially no effect on sig- 
nal efficiency, and reduces WS backgrounds by about 
35%. The cumulative effects of the additional semilep- 
tonic side selection criteria are summarized in Table [TTJ 
(The selection for electron momentum > 600McV/c is 
applied prior to calculating the acceptances listed in the 
table.) The effects of these additional selection criteria 
on D° — > K*ev events are reasonably consistent with 
those for D° — ► Kev events. The combination of the 
slow pion longitudinal and transverse momenta and the 
(M(Ke), AM) selections will hereinafter be referred to as 
the "double-tag kinematic selection" . Figure [3] shows the 
AM distributions of signal events in unbiased MC scaled 
to the luminosity of the data both before and after im- 
posing the double-tag kinematic selection. The marginal 
efficiency resulting from applying these last selection cri- 



teria to signal events is 84 ± 1%. 

The decay time distributions of the RS and WS signals 
should differ, as shown in Equation [TJ The RS sample 
is produced with an exponential decay rate, while the 
WS sample should be produced with the same exponen- 
tial rate modulated by t 2 . Figure [4] shows the normalized 
lifetime distributions for reconstructed simulated RS and 
WS signal events passing the final tag and signal-side se- 
lection. To improve sensitivity, we select only WS candi- 
dates with measured lifetimes between 600 fs (« 1.5t£>o) 
and 3900 fs 9.5r D a ), which accepts approximately 
80% of signal and less than 30% of background. Be- 
cause the RS signal-to-background ratio is comparatively 
very large, we accept RS candidates across the full range 
shown in Figure 2J This WS/RS relative efficiency has 
a 2% systematic uncertainty due to imperfect knowledge 
of the decay time resolution function. This is determined 
from changes in the WS/RS efficiency observed when 
varying the signal resolution function according to the 
difference between resolution functions observed in RS 
data and MC samples. 
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FIG. 3: RS signal AM distribution in unbiased MC scaled 
to the luminosity of the data before (line) and after (solid) 
applying the double-tag kinematic selection. 



FIG. 5: WS AM distribution for background events passing 
the WS decay time selection in unbiased MC scaled to the 
luminosity of the data before (line) and after (solid) applying 
the double-tag kinematic selection. 



e55 




t[fs] 

FIG. 4: Normalized RS (dashed) and WS (solid) recon- 
structed simulated signal lifetime distributions. The solid ver- 
tical lines mark the range for the selection of the WS events. 



Figure QJb) shows the NN event selector output for RS 
signal, RS backgrounds and WS backgrounds in the un- 
biased MC sample passing the additional scmilcptonic 
side selection criteria (scaled to the luminosity of the 
data). The effectiveness of the additional semileptonic- 
side criteria in suppressing WS backgrounds while simul- 
taneously retaining good signal efficiency can be seen by 
comparing Figures HJa) and (b). Figure [S] shows the AM 



distribution of WS backgrounds passing the decay time 
selection in unbiased MC scaled to the luminosity of the 
data both before and after the double-tag kinematic se- 
lection. A total of 2.85 background candidates, the sum 
of the luminosity-scaled events in the solid histogram 
shown in the figure, is expected after all event selection 
criteria are applied. 

D. Measuring Signal Yields 

To determine the mixing rate, we first determine the 
number of RS signal candidates by fitting the RS AM 
distribution, as described in detail below. We then esti- 
mate the expected rate of WS background events in the 
signal region of the data from the unbiased MC sample. 
Using several background control samples drawn from 
both data and MC, we estimate how well MC events de- 
scribe real data events. Using a statistical procedure with 
good frcqucntist coverage, we combine the number of can- 
didates observed in the WS sample, the expected back- 
ground rate, and the estimated systematic uncertainty in 
the expected background rate to obtain a central value for 
the mixing rate and 68% and 90% confidence intervals. 
This procedure is described in detail in the appendix. 

We extract the number of RS signal events from the 
AM distribution of the RS sample selected without the 
double-tag kinematic selection using an extended max- 
imum likelihood fit. The likelihood function includes 
probability density functions (PDF's) for the signal, the 
background events which peak in the signal region, and 
the combinatorial background. The PDF for each event 
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FIG. 6: RS data AM distribution. The main plot shows the 
RS data (points) before imposing the double-tag kinematic 
selection, and the projections of the total fit PDF (solid line) 
and the background PDF (dashed line). The inset plot shows 
the RS AM distribution after the double-tag kinematic selec- 
tion criteria are applied. 



class is assigned using the functional forms described 
in Ref. ||. The shape parameters for the combina- 
toric background are determined using the following tech- 
nique: D° signal candidates in the data from one event 
are combined with tt s candidates from another event to 
model the shape of this PDF. Based on MC studies, the 
shape of the peaking AM background is assumed to be 
the same shape as the signal. Its relative level is also de- 
termined from MC studies. The shape parameters of the 
signal PDF, as well as the number of RS signal events 
and the number of combinatorial background events, are 
then obtained from the likelihood fit of the data. 

The main plot in Figure [5] shows the AM fit of the RS 
data before applying the double-tag kinematic selection, 
with the signal and background contributions overlaid. 
The fitted RS signal yield in this sample is 5748 ± 90 
events, with % 2 = 77 for 60 bins, where six parameters 
are determined from the fit. The inset plot of Figure [6] 
shows the RS data AM distribution after the double-tag 
kinematic selection is imposed. As noted above, the ef- 
ficiency of this selection is 0.84 ± 0.01, giving a final RS 
signal yield of 4780 ± 94, which is used as the normaliza- 
tion in calculating the mixing rate. 

To determine the number of WS mixed events, we con- 
sider three regions of AM: the signal region, AM < 
0.20GcV/c 2 ; the near background region, 0.20 < AM < 
0.25GeV/c 2 ; and the far background region, 0.25 < 
AM < 0.35GeV/c 2 . These AM ranges are shown in 
Figure and are respectively labeled "1", "2" and "3" 
in the plot. To avoid potential bias, we examine nei- 
ther the signal region nor the near background region 
in the WS data sample until all of the selection crite- 
ria and the procedure for calculating confidence inter- 
vals are determined. The WS signal region may contain 
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FIG. 7: WS data AM distribution. The dark histogram 
shows WS events in the data passing all event selection. The 
light histogram shows WS events passing all selections except 
the double-tag kinematic selection. Region "1" is the signal 
region, "2" is the near sideband, and "3" is the far sideband. 



both signal and background events after applying the fi- 
nal event selection critera. As discussed above, we de- 
termine the expected number of background events from 
the unbiased MC sample: we observe 5 events, which 
scales to 2.85 for the luminosity of the data. To estimate 
the possible non-cc background rate, we also examine 
events which satisfy the semilcptonic-sidc selection crite- 
ria but fail the tagging-side criteria because the mass of 
the hadronic D candidate falls outside the accepted win- 
dow. Since we had examined the data events in the "far" 
sidebands (sidebands "2" ) of Figure [2] while optimizing 
hadronic side selection criteria, we also examine those 
in the "near sidebands (sidebands "1") to estimate the 
number of these "false tag" events: we find no WS can- 
didates in the near or far AM sideband regions in either 
the data or unbiased MC sample. Given the agreement 
between data and the unbiased MC sample, we deter- 
mine the central value of the number of WS signal events 
by subtracting the luminosity-scaled number of unbiased 
MC WS background events in the signal region from the 
number of candidates observed in the data there. 

The dark shaded entries in Figure [7] denote the AM 
distribution of WS candidates in the data after all event 
selection, where we observe 3 WS candidates in the sig- 
nal region and none in the sideband regions. Given the 
expected WS background of 2.85 events shown in the 
solid histogram of Figure [51 we calculate a net WS signal 
yield of 0.15 events. We discuss below the total error as- 
sociated with the estimated number of WS background 
events. 
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E. Systematics and Confidence Intervals 

To calculate confidence intervals for the number of 
mixed events observed, we first determine a systematic 
uncertainty associated with the WS background esti- 
mate. To do this, we compare 10 background control 
samples in data with the corresponding MC samples. 
The results of this comparison are shown in Table IIIII 
The first line compares the number of WS events ob- 
served in the far background region of the data and the 
tuning MC sample. The second line compares the same 
numbers for the data and for the unbiased MC sam- 
ple. The remaining table entries compare the number 
of events observed in two types of doubly-charged (DC) 
background samples obtained from data with those ob- 
served from the same sources in unbiased MC events. In 
both of the DC background samples, the kaon and the 
electron have the same charge sign, and are reconstructed 
exactly as neutral K e vertex candidates are, except for 
the differing charge correlation. In those additionally la- 
beled WS, the slow pion has the same charge as the kaon, 
while in those additionally labeled RS, the slow pion has 
the opposite charge. 

Ignoring the correlations between entries 3,5 and 4,6 in 
Table UTTl we estimate the consistency between the data 
and MC samples by calculating a summed x 2 for all the 
entries: 
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X 2 (data, MC) = ^ 



„data 



„MC\2 



(af ata ) 2 + (a 



11.4 (3) 



The value x 2 = 11 -4 is consistent with 1 per degree 
of freedom. Taken together, these observations indicate 
that the MC estimate for the background rate in the sig- 
nal region of the WS sample is reasonably accurate. We 
conservatively assign the largest discrepancy between the 
data and MC rates, 50%, as the systematic uncertainty 
associated with the ratio between the MC estimate of the 
background rate and its true value. 

To determine confidence intervals for the number of 
WS mixed events, we adapt a suggestion made in Ref. 
[l||. The complete statistical procedure is described in 
detail in the appendix; it is summarized here. We start 
with a likelihood function, £(n,nb', s, b), for the number 
of events observed in the signal region of the WS data 
sample, n, and the corresponding number observed in 
the MC sample, n&. C(n, n^; s, b) depends upon the true 
signal rate s and the true background rate b in the signal 
region, and also accounts for the systematic uncertainty 
in the ratio of the true background rate in data to that 
estimated from MC. The value of (s,b) which maximizes 
the likelihood function, £ max , is denoted by (s, b). As one 
expects naively, b is equal to rif, times the ratio of data 
and MC luminosities while s = n — b. We then search for 
the values of s where — hxC(s) changes by 0.50 [1.35]; here 
C{s) denotes the likelihood at s maximized with respect 
to b. The lower and upper values of s which satisfy this 



condition define the nominal 68% [90%] confidence inter- 
val for s. As discussed in the appendix, for the range 
of parameters relevant for this analysis, the confidence 
intervals produced by this procedure provide frcquentist 
coverage which is accurate within a few percent. 



F. Final Results and Conclusion 

We observe 3 candidates for D° — D° mixing, compared 
to 2.85 expected background events, where we ascribe a 
50% systematic uncertainty to this expected background 
rate. We find the central value for the number of WS sig- 
nal events to be 0.15, with 68% and 90% confidence in- 
tervals (—2.2,2.8) and (—5.2,4.7), respectively. Account- 
ing for the ratio of WS and RS signal efficiencies due to 
the cut on the measured WS decay time (0.80 ± 0.02), 
we find the central value of r m i X to be 0.4 x 10 -4 , with 
68% and 90% confidence intervals (-5.6, 7.4) x 10" 4 and 
(—13,12) x 10~ 4 , respectively. We ignore variations in 
the RS yield due to statistical error and systematic ef- 
fects in the RS fit as they are negligible relative to the 
statistical errors associated with the WS data and MC 
rates as well as the 50% systematic error assigned to the 
ratio of MC and data WS background rates. 

The sensitivity of this double-tag analysis is compa- 
rable to that expected for a single-tag analysis, see Ta- 
ble HI Future analyses should be able to combine these 
two approaches to significantly improve overall sensitiv- 
ity to charm mixing using scmilcptonic final states. Im- 
proved methods for reconstructing and selecting scmilcp- 
tonic signal candidates, the use of more hadronic tag- 
ging modes, and the additional use of semi-muonic decay 
modes may allow semileptonic charm mixing analyses to 
approach the r m j X sensitivity of analyses using hadronic 
final states. 
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TABLE III: Comparison of MC and data background yields. The doubly-charged (DC) MC entries refer to MC event samples 
disjoint from those used to optimize event selection. The "kinematic selection" refers to the double-tag kinematic selection. 



Entry 


Data Sample 


AM Range (GeV/c^) 


kinematic selection 


Data 


MC 


1 


WS, tuning MC 


0.25 < AM < 0.35 


no 


2± 1.4 


2.1 ± 1.5 


2 


Wb, unbiased MC 


0.25 < AM < 0.35 


no 


2 ± 1.4 


3.4 ± 1.4 


3 


DC, RS 


AM < 0.20 


yes 


37 ±6 


40 ±5.1 


4 


DC, WS 


AM < 0.20 


yes 


36 ±6 


51 ±5.8 


5 


DC, RS 


AM < 0.20 


no 


42 ± 7 


47 ±5.5 


6 


DC, WS 


AM < 0.20 


no 


55 ±8 


64 ± 6.5 


7 


DC, RS 


0.20 < AM < 0.25 


no 


20 ±5 


24 ±3.9 


8 


DC, WS 


0.20 < AM < 0.25 


no 


13 ±4 


19± 3.5 


9 


DC, RS 


0.25 < AM < 0.35 


no 


20 ±5 


31 ±4.5 


10 


DC, WS 


0.25 < AM < 0.35 


no 


23 ±5 


18 ±3.4 
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APPENDIX A: APPENDIX: STATISTICAL 
METHOD FOR ESTABLISHING CONFIDENCE 
INTERVALS 

To estimate confidence intervals for the number of WS 
signal events, we adapt a method suggested in Ref. [ill . 
For a true signal rate, s, and background rate, b, in the 
WS signal region, we determine probability density func- 
tions (PDF's) for the number of background events we 
should observe in our Monte Carlo simulation, rib, an d 
the number of candidates we should observe in the WS 
signal region, n. We use these to define a global likeli- 
hood function for s and b which depends upon n and n^: 
C(s,b;n,nb). Given an observation (n, nt), the central 
value for s is that which maximizes C(s, 6; n, rib). The 
boundaries of confidence intervals for s are then defined 
by the extremum signal rates in the (s, b) plane where the 
logarithm of the likelihood function changes by specified 
values based on those that would provide proper frequcn- 
tist coverage in the limit of high statistics and Gaussian 
distributions. Assuming the PDF's we use are correct, 
we validate this algorithm by checking the frequentist 
coverage it produces for a range of values of s and b. 

The PDF for rib is taken to be 

P(n b ;b MC )= (Al) 

f°° X nb 1 1 (i>MC-') a 

N{b M c) / — t^—t, re 2 " (l »rf 2 dx . 

J n b \ d(t)Mc) 

In this equation, rib and b are as defined above; &mc = 
ab is the mean number of events expected in a Monte 
Carlo simulation with a times the luminosity of the data 
sample, and <t(&mc) = 0.5 &mc accounts for the 50% sys- 
tematic uncertainty in the ratio of background rate in the 



12 



data and background rate in the MC simulation. iV(&Mc) 
is the normalization such that X^=o P( n b'> &mc) = 1- 

The PDF for the combined observation (n, rib) is then 
taken to be the product of P{rib] &mc) and a purely Pois- 
son term for n: 



• We let si be the minimum value of s in this set 
and s u be the maximum value. The 90% (68%) 
confidence intervals for s are the ranges (s;, s u ). 



P{n-n h )= (A2) 
P(n b ;b MC = a b)x { -^±P^e^ s+b K 

To obtain 90% (68%) confidence intervals, we use the 
following procedure. 

• We write the global likelihood function for s and b 

as 



£{s,b;n,n b ) = 
P(nb; bMC = ab) x 



(A3) 



(s + by 



-(s+b) 



III 



• Wc find the values (s, b) for parameters (s, b) which 
maximize the likelihood. These are b = rib /a and 
s = n — b. 

• With C max the value of the likelihood at its maxi- 
mum, we obtain a 90% (68%) confidence intervals 
for s, by finding the points in the (s, b) plane where 

Aln£ = lmCmax - hi£(s, b; n, n b ) (A4) 
= 1.35 (0.50). 



We have determined the frequcntist coverage of this 
algorithm for many values of (s, b) by using the PDF of 
Eqn. (|A3j) to generate large samples of (n,rib). For any 
one (s,6), we consider the ensemble of all (n,rib) gener- 
ated. For each of these (n, n b ) we determine whether s is 
contained in the 90% (68%) confidence interval defined 
using the algorithm described above. The fraction of all 
(n, rib) containing the true value s is called the coverage. 
The coverage is therefore a function of both s and b as 
well as the the level (68% or 90%). As a function of s be- 
tween and 10, and for fixed values of b between 2.5 and 7 
(where 2.85 is the central value "expected" based on our 
observation of n b ), the coverages we calculate are close to 
the nominal values. In this range, the 68% intervals pro- 
vide coverages between 64% and 72% with the most se- 
vere undercoverage observed for s < 2; the 90% intervals 
provide coverage between 87% and 92%s with the most 
severe undercoverage again observed for s < 2. The de- 
viations from nominal coverage are relatively small. We 
judge the statistical properties of the quoted intervals to 
be sufficiently accurate for this analysis. 
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